Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization

Add code
Mar 28, 2025
Viaarxiv icon

SFDLA: Source-Free Document Layout Analysis

Add code
Mar 24, 2025
Viaarxiv icon

PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction

Add code
Mar 21, 2025
Viaarxiv icon

UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis

Add code
Mar 20, 2025
Viaarxiv icon

An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation

Add code
Mar 15, 2025
Viaarxiv icon

TextBite: A Historical Czech Document Dataset for Logical Page Segmentation

Add code
Mar 20, 2025
Viaarxiv icon

MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Add code
Mar 20, 2025
Viaarxiv icon

EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation

Add code
Feb 23, 2025
Viaarxiv icon

OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models

Add code
Feb 22, 2025
Viaarxiv icon

Qwen2.5-VL Technical Report

Add code
Feb 19, 2025
Viaarxiv icon